Bayesian group latent factor analysis with structured sparsity

نویسندگان

  • Shiwen Zhao
  • Chuan Gao
  • Sayan Mukherjee
  • Barbara E Engelhardt
چکیده

Latent factor models are the canonical statistical tool for exploratory analyses of lowdimensional linear structure for an observation matrix with p features across n samples. We develop a Bayesian group factor analysis (BGFA) model that extends the factor model to multiple coupled observation matrices. Our model puts a structured Bayesian hierarchical prior on the joint factor loading matrix, which achieves shrinkage effect at both a local level (element-wise shrinkage) and a factor level (column-wise shrinkage) with non-parametric behavior that removes unnecessary factors. With two observations, our model reduces to Bayesian canonical correlation analysis (BCCA). We exploit the shrinkage behavior in the BGFA model to recover covariance structure across all subsets of the observation matrices where this signal exists. We validate our model on simulated data with substantial structure and compare recovered factor loadings against results from related methods. We then show the results of applying BGFA to two genomics studies for different analytic aims: identifying gene co-expression networks specific to one of two conditions, and recovering sets of genetic variants that jointly regulate transcription of a collection of genes. We illustrate the unique ability of BGFA to use multiple observations of the same samples to guide linear projection of the data onto a latent space, producing meaningful and robust lowdimensional representations, as compared with ‘unsupervised’ projections from traditional factor analysis or principal components analysis.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian group factor analysis with structured sparsity

Latent factor models are the canonical statistical tool for exploratory analyses of lowdimensional linear structure for an observation matrix with p features across n samples. We develop a structured Bayesian group factor analysis model that extends the factor model to multiple coupled observation matrices; in the case of two observations, this reduces to a Bayesian model of canonical correlati...

متن کامل

Group Sparsity in Nonnegative Matrix Factorization

A recent challenge in data analysis for science and engineering is that data are often represented in a structured way. In particular, many data mining tasks have to deal with group-structured prior information, where features or data items are organized into groups. In this paper, we develop group sparsity regularization methods for nonnegative matrix factorization (NMF). NMF is an effective d...

متن کامل

Bayesian Structured Sparsity from Gaussian Fields

Substantial research on structured sparsity has contributed to analysis of many different applications. However, there have been few Bayesian procedures among this work. Here, we develop a Bayesian model for structured sparsity that uses a Gaussian process (GP) to share parameters of the sparsity-inducing prior in proportion to feature similarity as defined by an arbitrary positive definite ker...

متن کامل

Bayesian group latent factor analysis with structured sparse priors

Latent factor models are the canonical statistical tool for exploratory analyses of lowdimensional linear structure for an observation matrix with p features across n samples. We develop a Bayesian group factor analysis (BGFA) model that extends the factor model to multiple coupled observation matrices. Our model puts a structured Bayesian hierarchical prior on the joint factor loading matrix, ...

متن کامل

Dynamic Factor Volatility Modeling: A Bayesian Latent Threshold Approach

We discuss dynamic factor modeling of financial time series using a latent threshold approach to factor volatility. This approach models time-varying patterns of occurrence of zero elements in factor loadings matrices, providing adaptation to changing relationships over time and dynamic model selection. We summarize Bayesian methods for model fitting and discuss analyses of several FX, commodit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014